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^ ■ Abstract. Given a directed acyclic graph with positive edge-weights, 

Qh| two vertices s and t, and a threshold-weight L, we present a fully- 

■^T . polynomial time approximation-scheme for the problem of counting the 

s-t paths of length at most L. We extend the algorithm for the case of two 
(or more) instances of the same problem. That is, given two graphs that 
have the same vertices and edges and differ only in edge-weights, and 
given two threshold- weights L\ and L2, we show how to approximately 
^0 ■ count the s-t paths that have length at most L\ in the first graph and 

{~*\ • length at most L2 in the second graph. We believe that our algorithms 

should find application in counting approximate solutions of related opti- 
mization problems, where finding an (optimum) solution can be reduced 
to the computation of a shortest path in a purpose-built auxiliary graph. 
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Systematic generation and enumeration of combinatorial objects (such as graphs, 



\Q , set systems, and many more) has been a topic of extensive study in the field of 



combinatorial algorithms for decades |T|. Counting of combinatorial objects has 
(~} [ been investigated at least as thoroughly, even leading to their own computational 

£T) • complexity class =#P, denned in Valiant's seminal paper [2]. A counting problem 

usually asks for the number of solutions to a given combinatorial problem, such 
as the number of perfect matchings in a bipartite graph. In combinatorial op- 
timization, the number of optimum solutions can sometimes be computed by a 
r> . modification of an algorithms for finding a single optimum solution. For instance, 

3 \ for shortest s-t paths in graphs with positive edge weights, Dijkstra's algorithm 

easily admits such a modification. The problem we discuss in this paper has a 
more general flavor: We aim at counting the number of approximate solutions, 
in the sense of solutions whose objective value is within a given threshold from 
optimum. For shortest s-t paths, it is not obvious how to count the number of 
paths within, say, 10% from optimum. A related problem of enumerating fea- 
sible solutions makes a step in this direction: If we can enumerate solutions in 
order of decreasing quality, starting from an optimum solution, we have a way to 
count approximate solutions. Even though for some problems there are known 
enumeration algorithms that return the next feasible solution in the sequence 
of solutions within only polynomial extra time (called "polynomial delay") , this 
approach will usually not be satisfactory in our setting. The reason is that the 



number of approximate solutions can be exponential, and counting by enumer- 
ating then takes exponential time, while our interest is only in the count itself. 

In this paper we propose a way to count approximate solutions for the short- 
est s-t path problem in directed acyclic graphs (DAGs) in polynomial time, but 
the count that we get is only approximate, even though we come as close to the 
exact count as we wish (technically, we propose an FPTAS). We also show that 
exact counting for our problem is NP-hard, thus (together with the FPTAS) fully 
settling its complexity. We achieve our result by a modification of a conceptually 
interesting dynamic program for all feasible solutions for the knapsack problem 
[3J. Our motivation for studying our counting problem comes from a new ap- 
proach |3] to cope with uncertainty in optimization problems. There, we not only 
need to count the number of approximate solutions for a given problem instance, 
but we also need to count the number of solutions that arc approximate (within 
a given approximation ratio) for two problem instances at the same time. For the 
case of shortest s-t paths, this means that we are given two input graphs that 
are structurally identical, but are allowed to differ in their edge weights. We now 
want to count the number of s-t paths that are within, say, 10% from optimum in 
both input graphs at the same time. For this problem we propose both a pseudo- 
polynomial algorithm and an algorithm that calculates an approximate solution 
for a potentially slightly different threshold in fully polynomial time. Our hope is 
that our study paves the way for approximately counting approximate solutions 
for other optimization problems, such as minimum spanning trees. 

The rest of the paper is organized as follows. We talk about possible further 
implications of our result in Section 11.11 We will show that our problem is NP- 
hard in Section 11.21 We present the algorithms in Section [2] and conclude the 
paper in Section [3] 

1.1 Dynamic Programming as Shortest-Path Computation in DAGs 

The concept of computing a shortest s-t path in a directed acyclic graph has a 
large number of applications in many areas of algorithmics. This is due to the 
fact that many, if not most, dynamic programming algorithms can be viewed 
as theproblem of looking for the shortest or longest path in a directed acyclic 
graphQ 

In many problems that admit a dynamic programming solution we are inter- 
ested not only in the single optimum, but also in other approximately optimal 
solutions. For instance, if we single out the context of analysis of biological data, 
de novo peptide sequencing |5|6| . sequence alignment |7 , or Viterbi decoding 
of HMMs |8I9| all use dynamic programming to find a shortest path in some 
implicit graph. Due to the nature of the data in these applications, producing 
a single solution is often insufficient and enumerating all solutions close to the 
optimum is necessary. Our contribution, therefore, provides a faster solution for 
the problems where counting of approximate solutions is necessary [7i . Counting 



1 Note that due to the lack of cycles, the problems of looking for shortest and longest 
paths on DAGs are computationally identical. 



and sampling from close-to-optimum solutions is the key-element of the recent 
optimization method with uncertain input data of Buhmann et al. [3]. Our work 
thus makes a step towards practical algorithms in this context. 

1.2 NP-hardness of Counting Approximate Solutions 

The problem of counting all s-t paths in a directed (or undirected) graph is known 
to be #P-complete [10]. The proof of [TO] makes use of graphs containing cycles. 
Thus, it cannot be directly used to show hardness of our problem. On contrary, 
we consider directed acyclic graphs, and additionally with edge-weights, and 
we are only interested in s-t paths of length at most L. In the following, we 
show that the problem of counting the paths in a directed acyclic graph that 
have length at most some given threshold L is NP-hard, by a reduction from a 
decision version of the partition problem, which is defined as follows. Given a set 
of positive integers S = {s\, . . . , s n }, the partition problem asks whether there 
exists a partition of S into sets S± and S2 such that the sums of numbers in both 
sets are equal. 

Given an instance S — {si, . . . , s n } of the partition problem, we construct a 
graph with n + 1 vertices vi, . . . , w n +i &s follows. We consider the elements of S 
in an arbitrary order s%, . . . ,s n . Then, for every i < n, the graph will contain two 
parallel edges between vertices Vi and u^+i with lengths s^ and — Sj, respectively. 
Then every path from v\ to v n +i corresponds to one partition of S to subsets Si 
and S2. If, between two consecutive vertices Vi and Vi+i, the edge with length 
Si is chosen, Si will belong to the set S\. If the chosen edge has length — Si, 
the element s,; will belong to the set £2. The length of the ?;i-v n+ i path then 
corresponds to the difference between the sums of elements in Si and in S^- 
If there exists a path of length 0, then (and only then) the resepctive sums of 
elements in Si and S2 are equal and the answer to the partition problem is "yes". 

We can answer the partition problem using an algorithm that counts the 
number of s-t paths of length at most L as follows. We first count the number 
of vi-i>„+i paths that have length at most —1, and then we count the number 
of ^i-v„ + i paths that have length at most 0. If (and only if) these two counts 
differ, there exists a path with length and the answer to the partition problem 
is "yes". Otherwise, the answer is "no". 

Since the partition problem is known to be weakly NP-complete [11 , the 
counting problem of s-t paths of length at most L is weakly NP-hard. Note 
that the existence of parallel edges is not necessary for the reduction; we could 
bisect each parallel edge creating an auxiliary vertex to form a graph of the same 
functionality but without parallel edges. Also, observe that the use of negative 
edge-weights is not necessary; we can add to every edge-weight a very large 
number M (say, the maximum number in S) , and then ask whether there exists 
a path of length nM. Thus, we have shown the following. 

Theorem 1. Let G be a directed acyclic graph with integer edge-weights, and L 
be an integer. The problem of counting all s-t paths of length at most L is weakly 
NP-hard, even if all edge-weights are non-negative. 



2 Approximation Algorithms 

In this section we present an FPTAS for our counting problem. That is, given a 
directed acyclic graph G on n vertices, two dedicated vertices s and t, a weight- 
threshold L, and a constant e > 0, we present an algorithm that computes a 
(1 + e)-approximation of the total number of s-t paths of length at most L, and 
which runs in time polynomial in both n and - . 

We first show a recurrence that can be used to exactly count the number of 
s-t paths of length at most L. Evaluating the recurrence takes exponential time, 
but we will later show how to group partial solutions together in such way that 
we trade accuracy for the number of recursive calls. We adapt the approach of 
Stefankovic et al. [3], which they used to approximate the number of all feasible 
solutions to the knapsack problem. 

Let G be a directed acyclic graph with n vertices. We will label the vertices 
v\ 1 . . . ,v n in such order that there is no path from Vi to Vj unless i < j, i.e. 
vi, . . . ,v n defines a topological ordering. We suppose that v\ — s and v n = t, 
otherwise the graph can be pruned by discarding all vertices that appear before 
s and after t in the topological order, since no path from s to t ever visits these. 

For a concrete vertex Vi with in-degree d, let us denote its d neighbors that 
precede it in the topological order by Pi,i, ■ ■ ■ ,Pi,d and let us denote the corre- 
sponding incoming edge lengths by Zj,i, . . . , Zj,<z. We will usually drop the index 
i to simplify the notation. Instead of asking for the number of s-t paths that are 
shorter than L for a given L, we indirectly ask for smallest threshold L, such 
that there are at least a paths from s to t, shorter than L. Let r{vi, a) denote the 
minimum length L such that there are at least a paths from v\ to Vi of length 
at most L. r(vi,a) can be computed by the recurrence 

t(vi,0) = — oo 

t(vi,o) =0,Va:0 <a < 1 

r(wi,a) = oo,Va : a > 1 

{T(p 1 ,a 1 a) +h 
'■ 
■ 
r(p d ,a d a) + l d 

Intuitively, the at least a paths starting at v\ and arriving at Vi must split 
in some way among incoming edges. The values olj define such split. We look 
for a set of a\, . . . , a d that minimizes the maximum allowed path length needed 
such that the incoming paths can be distributed according to aj, j — 1, . . . , d. 
Note that while the values of aid do not have to be integer, r(vi,aia) is equal 
to r(vi, \aia\). Moreover, when evaluating the recursion, it is enough to search 
for values on such that each of the values a\a, . . . , a d a is integer. 

To find the number of paths of length at most L, we search for a such that 
T{v n ,a) < L < r{v n ,a + 1), and return it as the output. In particular, if the 
length of the shortest s-t path is OPT (which can be computed in polynomial 
time), we can find, for any p > 1, the number of p-approximate s-t paths by 
setting L := pOPT. 



Calculating r using the given recurrence will not result in a polynomial time 
algorithm since we might need to consider an exponential number of values for 
a, namely 2"~ 2 on a DAG with a maximal number of edgeso To overcome this, 
we will consider only a polynomial number of possible values for a, and always 
round down to the closest previously considered one in the recursive evaluation. 
If we are looking for an algorithm that counts with 1 + e precision, the ratio 
between two successive considered values of a must be at most 1 + e. 

For this purpose, we introduce a new function r'. In order to achieve precision 
of 1 + e, we will only consider values of r' for minimum path numbers in the 
form of q k for all positive integers k such that q k < 2™ -2 , where q = "~\/l + e. 
The values of t' for other numbers of paths will be undefined. The function r' 
is defined by the recurrence 



t'(vi,Q) = — oo 

t'(vi,o) = 0,Va : < a< 1 

r'(ui,a) = oo,Va : a > 1 

(V(pi,^ +log * ai J) + Ji 
T , (vi,q :l )= min max < : . (1) 

eV=i [ T >(pf : gU+iog, °*j ^ + ld 

To give a meaning to the expression qLi+iog, Q ;J wn en on = 0, we define it to be 
equal to 0, which is consistent with its limit when a,i goes to 0. We want to show 
that the rounding does not make the values of r' too different from the values of 
r. Ideally, we would want to get bounds like t^,^ -1 ) < r'{vi,q 3 ) < T(vi,q 3 ), 
but we will see that it is sufficient to show that T(vi,qi~ l ) < r'(vi, qi) < r(vi, q^). 

Lemma 1. Let 1 < i and i < j. Then 

T( Wi ,^- l )<T'(t.. i ,q^)<T( Wi ,^'). (2) 



Proof. We first prove the first inequality, proceeding by induction on i. The base 
case holds since r(vi,a) < r'(vi,b) for any a < b. Suppose now that the first 
inequality of @ holds for every p, p < i. Then, for every < a < 1, 

r'(p,g Lj ' +log * aJ ) > r(p,gLj + i°g,«J-P) 
> T(p,q j - p - 1+loe « a ) > T{p,aqi- 1 ). 

Thus, since every predecessor of w,; is earlier in the vertex ordering, we can 
use the obtained inequality to get the claimed bound 



2 To see this, observe that in a topologically sorted graph G, any subset of V \ {s, t) 
gives a unique candidate for an s-t path. 



-'(pi,gLj+i°g,«iJ) + / 1 



t' (vj ,q J ) = min max 

v ' ai,...,a d 



{T(pi,aig J "" , *)+i!i 
■ 

= T{v i ,q j - i ). 

The other inequality T'(vi : q>) < r(vi,qi) follows by a simpler induction on 
i. The base case holds since t{vi,x) = t'(v\,x) for all x. Assume now that the 
second part of @ holds for all p < i. Then 

T'(p,gLj"+i°g,«.J) < T (p,gLj+i°g,«.J) < T {p,on^). 

We can now use the recursive definition to obtain the claimed inequality 
r'ivuq ) < r(w J ,g J ): 



T'( Pl ,q^ +lo ^ a ^) + h 



T'{vi,q r ) 



mm max 

u 1 ,...,a d 



£°y=i {T'(p d ,q^ +lo ^ ad ,i) + l d 

{r(p\,a\q j ) + Zi 
T(pd,adiq j ) +ld 

= TiVi^ 1 ). 

□ 

The next theorem shows how to use r' to produce a (1 + e)-approximation 
for the counting problem. We need to show that for any L, if we find k such that 
T~'(v n ,q k ) < L < r'(v n , q k+1 ), the value q k will be no more than (1 + £) ±:L away 
from the value a for which r(v n , a) < L < r(u n , a). 

Theorem 2. Given L, let k be such that T'(v n ,q k ) < L < r'(v n , q k+1 ) and a be 
such that r{v n ,a) < L < r{v n ,a + 1). Then (1 + e)^ 1 < -% < 1 +s. 

Proof. Applying Lemma Q] twice, we get T(v n ,q k ~ n ) < T'(v n ,q k ) < L < 
T'(v n ,q k+1 ) < r(v n ,q k+1 ). As T(v n ,q k ~ n ) is at most L, and a is largest such 
that t(v„, a) < L, and r is monotonous in its second parameter, it must be that 
q k ~ n < a. Similarly, T(u„,<Z fc+1 ) is larger than L, so by monotonicity a < q k+1 . 
Thus both a and q k must lie between q k ~ n and q fe+1 and their ratio can be at 
most g fc +!-( fc -») = q "+! = 1 + e and at least q k ~i k + x ) = (1 + e )-V(«+i) > 
(1 + e)- 1 . □ 



It is left to be shown that the we can compute all values of the function r' 
in polynomial time. 

Theorem 3. Given a largest allowed path-length L, there is an algorithm that 
finds k satisfying T'(v n ,q k ) < L < t' (v ni q k+l ), and runs in time 0(mn 2 e~ 1 log n) , 
where m denotes the number of edges in the graph. 

Proof. As noted before, directed acyclic graph on n vertices has at most 2™~ 2 
paths between any two vertices. The values of a in r therefore span at most 
{1,2,..., 2™~ 2 }, and the values of q k in t' span at most {1, q, q 2 , . . . , q s }, where 

, , 9 n-2s ("~2) (rc-2)(n+l) 2 _ 

s := log„(2 ) = = — ■ — = U(n e ). 

q log 29 Ioga(l + e) 

Thus, we evaluate the function r' for at most ns = 0(n 3 e~ 1 ) different parameter 
pairs. 

To show that the evaluation of t' can be done in polynomial time, we need 
to show that we can efficiently find a±, . . . ,otd that minimize Expression ([TJ. 
Fortunately, r'(vi,q k ) is monotonous with increasing k, we can thus apply a 
greedy approach. Given Vi, we will evaluate r'(vi,q k ) for all possible values of 
q k in one run. Instead of the tuple a\ . . . a<j we will consider an integer tuple 
k\ . . . kd- We start with all ki equal to and always increase the ki that minimizes 
r'(pi, q ki+1 ) + U by one. Whenever the sum of all q k increases over some value 
q k , we store the current maximum of r'(pi,q ki ) + k as the value r'(vi,q k ). We 
terminate once ^\ q ki reaches 2 n ~ 2 . As we can increase each ki at most s times, 
we make at most ds steps, each of which involves choosing a minimum from 
d values and replacing it with a new value. The latter can be done in time 
0(\ogd) C O(logn), for instance by keeping the values r'(vi, q ki+1 ) + l i in a 
heap. The sum of d for all vertices is equal to the number of edges m. The time 
necessary to find all the values of r'(v n , qi) is thus Oimrfe -1 logn). □ 

2.1 FPTAS for solutions that are approximate in two instances 

In this section we consider the problem of counting solutions that are approximately- 
optimum for two given instances at the same time. The two instances differ in 
edge lengths, but share the same topology, |f| effectively giving each edge two dif- 
ferent lengths. Formally, given two directed acyclic graphs G\ and G^. differing 
only in edge- weights, given two vertices s and t, and given two threshold values 
L\ and L2, we are interested in the number of the s-t paths that have at the 
same time length at most L\ in G\ and length at most L2 in Gi- 

To solve this algorithmic problem, we cannot directly apply the approach for 
the single-instance case (by defining r to be a pair of path lengths, one for each 



3 Another way of looking at this is that if we have a bijection that tells us which vertex 
in the first instance corresponds to which vertex in the second instance, we can drop 
all edges that exist in only one of them, as any path using this will not approximate 
the optimal solution in the other instance. 



of the two instances) , as we now have two lengths per edge and it is unclear how 
to suitably define a maximum over pairs in Equation (JTJ) . We will nonetheless 
follow a similar approach and define a function r 2 similar to r that adds one of 
the path lengths in a form of a "budget" as a parameter of the function. 

The function Ti{vi,a, L\) is defined to be the smallest length L 2 such that 
there are at least a paths from V\ to Vi, no longer than L\ with respect to the 
edge lengths in the first instance, and no longer than L 2 with respect to the edge 
length in the second instance. Consider a vertex vi of in-degree d. As before, we 
denote the edge lengths of the d incoming edges of vertex Vi in the first instance 
by li . i = 1 , . . . , d. Similarly, we denote the edge lengths of the d incoming edges 
of vertex v, in the second instance by l[, i = 1, . . . , d. Then, r 2 can equivalently 
be defined by the following recursion: 



p^ 



T2(Vl,0, a;) = — 00, Vx £ K + 
Ti{vi,a, x) = 0,Vo : < o < l,Vx £ 
T2(vi,a,x) = 00, Va : a > l,Vx £ M. + 

{T 2 {pi,aia,Li - h) + l[ 
'■ 
Ti{p d ,a d a,Lx-l d ) + l' d 

If we wanted to use r 2 to directly use to solve our counting problem, the 
function t 2 would have to be evaluated not only for an exponential number of 
path counts a, but also for possibly exponential number of values of L\. To 
end up with polynomial runtime, we thus need to consider only a polynomial 
number of values for both parameters of t 2 . For this purpose, we will introduce 
a function t 2 that does this by considering only path lengths in the form of r k , 
where r = yl + d, and path numbers a in the form of q 3 , where q = yl + e, 
for positive e and S. Function t 2 is defined by the following recurrence: 

T 2 (t>i, 0, x) = — 00, Va; £ M + 
T^ Vl ,a,x) = 0,Va : < a < l,Vai £ K+ 
T2(vi,a, x) = 00, Va : a > l,Va; £ K + 

r 2 (pi,g Lj+Iog ' QlJ ,rL 1 °g.('' fe - i i)J) + /' 1 
t% (vi , q- 1 , r ) = min max ■ 



a 1 ,...,a d 



Similarly to the case of one instance only, we show that r 2 approximates r 2 
well, this time in two variables. 

Lemma 2. Let <i, i < j, and i < k. Then 

T 2 (v u q>-\r k ) < r' 2 (v tl q^r k ) < T 2 (vi, q j ,r k ~ i ). (3) 



Proof. We proceed as in the proof of Lemma Q] Note that the function r 2 is 
monotone non-decreasing in a, but monotone non-increasing in L\. Proceeding 
by induction on i, the base case holds since 72 (vi, a, y) < r 2 (vi,b,y) for any 
a < b and y. We suppose that Equation © holds for all p < i. Then, for every 
< a < 1, 

T 2 (p,g Lj+los ' Qj ,rL lo s^ rfc -')J)a>r 2 (p, g Lj+i°g,«J-P jr Liog.(r fc -0J) 
> T 2 ( P ,qJ-r- 1+l °^ a ,r k -l)> T 2 (p,aq^\r k - I). 

Thus, since every predecessor of Vi has index smaller than i, 



T!)(vi,q J , r ') — min max 
2y ' y ' ' ai,...,a d 



T. 



2 (p d ,q U+loe * adi , A lo ^(r k -i d )i ) + i' Q 



To 



(p 1 ,a 1 q^\r k -h) + l' 1 



> min max 

= T 2 (v t ,q ] -\r k ). 

The proof of the inequality r 2 (vi : qi ,r k ) < t 2 (v{, g- 7 , r fc ~'') is similar. Assuming 
that (J3J) holds for every p < i, we obtain 



t' (p, glJ+ lo g, a i ( r Liog r (r *-0J ) < T2 ( Pj gb'+log, "J ( j-LlogrO- -OJ-P) 
< r 2 b,a^',r log ^ r ' = -^- p - 1 ) < T 2 (p,aq\r k ^ - Z). 
Plugging it into the definition of t 2 , we obtain 

r 2 (Pi>9 lj+los ' jQl V Llos '' (rfc ~' l)J ) + ^i 



T 2 ( Vl ,q\r k ) 



mm max 

Qi,...,Q d 



T. 



2 (Pd,q U+l ° s " * di , A l °Zr(r k -i d )\ ) + i' d 



(T2{px,a 1 qi,r k - 1 -h) + l' l 
< min max < : 

E"'i™ [ r 2 (p d , cW, r fc - x - l d ) + l' d 

= T 2 (v l ,q 3 ,r k - 1 ). 







Using Lemma[2] we can show that r 2 provides enough information to compute 
an approximation of r 2 . However, we cannot get a (1 + e) approximation to the 
optimal value as in the Theorem [5] because we need to round the value of L\ to 
a power of r in order for it to be legal parameter of t 2 and we further round it 
during the evaluation of t 2 . We will therefore relate the result of r 2 to the results 



of t 2 we would have gotten if we considered the value of L\ when rounded up 
towards the nearest number that can be represented as r k for integer k and the 
value r k ~ n . Due to the choice of r, the ratio of these two values is 1 + 8. 

Theorem 4. Given desired maximum v\-v n path lengths L\ and L 2 with respect 
to two given instances, let integer k be such that T 2 (v n , q k , r' l ° 6r Ll ') < L 2 < 
T 2 {v n ,q k+1 ,r^°^ L ^), a be such that T 2 (v n ,a,r^°^ L ^- n ) < L 2 < T 2 (v n ,a + 
l ^ r lio Sr L 1 ]-n^ and b be i argest such that T2 ^ n ^ ;r riog r Lil) < L 2 < T 2 (v n ,a + 

l jr n°SrLi]y Then a < b, ■+ < 1 + e, and £ < 1 + e. 

Proof. The statement that a < b follows from the definition of a and b: decreasing 
the limit on the path length in the first instance from r^ log *- Ll ^ to r^ loe <- Ll ~\~ n 
cannot increase the number of possible paths. By applying Lemma [5] twice, we 
get 

r 2 K, g fc -",rn°g^il) <T 2 (v n ,q k ,r^Li}) < £ 2; ( 4 ) 

and 

L 2 < T 2 (v n ,q k+ \r^ L ^) < T 2 {v n ,q k+ \r^°^ L ^- n ). (5) 

From the definition of a and (0 we can conclude a < q k+1 . This implies that 
-% < q < 1 + e, due to our choice of q. Similarly from the definition of b and ([J} 

we get b > q k ~ n and thus ^- < q n < 1 + e. D 

Theorem |4] shows that the computed number of s-t paths q k cannot be larger 
than b by more than a factor of 1+e, nor can it be smaller than a by a factor larger 
than 1+e. We can now state the overall running time of the approach. Compared 
to the function r' we need to evaluate r^ for [log r L{\ — 0(n6~ 1 log Li) values of 
r , in addition to the values of Vi and q . Otherwise the arguments are identical 
to the proof of Theorem [3] Note that logLi is by definition in 0(n), but we list 
it explicitly since it can be much smaller in practice. 

Theorem 5. Given largest path-lengths L\ and L 2 for two given instances, there 
is an algorithm that finds k satisfying T 2 (v n , q k , r^ OSr Ll ^ ) < L < r 2 (u„, q k+1 , r^ loSr Ll ^), 
and runs in time 0(mn 3 e~ 1 6~ 1 \ognlogLi), where m denotes the number of 
edges in the graph. 

It is easy to see that we can extend the approach to count paths that ap- 
proximate m instances at the same time by adding "budgets" L\, . . . , L m -\ for 
the desired maximal lengths of paths in instances 1, 2, . . . , m — 1. The time com- 
plexity would again increase, for every additional instance with threshold Li by 
0{n6~ x log Li). 

2.2 Pseudo-polynomial algorithm for two instances 

If the discrepancy between a and b as defined in the Theorem [5] is too large and 
all edges have integer lengths, we can consider all possible lengths in the first 
instance, instead of rounding to values in the form of r k . The function r 2 will 
be t' extended with exact maximum length of path in the first instance. 



t£>i,0,x) = -oo,Va;eM + 
T%( Vll a,x) = 0,Va: 0< a< l,Vx€ R+ 
^'(ui, a, x) = oo, Va :o> l,Vs€ R + 

^'bi,g b ' +los ' QlJ ,i-/i) + /i 



' ' (vj.q 3 ,r ) = min max 

V ' y ' ' oti,...,a d 



'(Pd,g Lj+los ' QdJ ,i-?d) + ^ 



We will state the two theorems about accuracy and runtime without proofs, 
since these are similar to the proofs of theorems [2J and [3] 

Theorem 6. Given L, let k be such that ^'(vn, q k , L\) < L2 < T2(v n , q +1 , L\) 



and a be such that T2(v n , a, L\) < L2 < T2(v n , a + 1, L\). Then (1 + e) < -%■ < 



Theorem 7. Given maximum allowed path lengths L\ and L2 in two instances, 
if all edges have integer lengths, we can find k which satisfies T2(v n ,q k , L\) < 
L2 < Tz (v n , q , Li) in time 0(mn 2 e~ 1 Li\ogn), where m denotes the number 
of edges in the graph. 

Since the running time is linear in L\, the algorithm that corresponds to the 
evaluation of the function r'-l is pseudo-polynomial. 

3 Concluding Remarks 

In this paper we showed that there is an efficient algorithm to approximate the 
number of approximately shortest paths in a directed acyclic graph. This prob- 
lem is implicitly or explicitly present as an algorithmic technique in algorithmic 
solutions to a large number of different computational problems, not limited to 
the evaluation of solutions achieved by dynamic programming which we noted 
in Section ITTTI 

Our result allows us, for instance, to approximately count only the small (or 
large) terms of a polynomial p(x) = ^2 i aiX l , a, > 0, represented as a product 
riiPjC 3 -) °f polynomially many polynomial factors Pj(x), where each Pj(x) = 
Y)u bkX k has polynomially many terms, and where every bk > 0. This is especially 
interesting if the full expansion of p(x) has exponentially many terms. This 
may be a powerful tool, if extended to the case of both negative and positive 
bk, enabling the counting of approximate solutions for problems with known 
generating polynomials of solutions by weight. For instance, counting of large 
graph matchings [12] or short spanning trees [T3] can be done via generating 
polynomials (which, in general, have exponentially many terms). This direction 
is our primary future work. 

We have also showed that our algorithm can be extended, given threshold 
weights Lx,...,L m , and polynomially many graphs Gi,...,G m , to count s-t 



paths that have, at the same time, length at most Li in Gi, i = 1, . . . ,m. In the 
case when m = 2, this algorithm is necessary for application of the aforemen- 
tioned robust optimization method [4 to the various mentioned optimization 
problems. 
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